Corpus: spa-ad_web_2017_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 69 71 72 73 86
1000 788 865 890 896 950
10000 5662 7958 8749 9021 9416
100000 12482 20708 24481 25772 26947
1000000 12482 20708 24481 25772 26947


Zipf's diagram for sentence endings


Gnuplot diagram

5085 msec needed at 2018-06-16 16:05